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RAW SEQUENCE LISTING DATE: 11/28/2001 53 fTJ 

PATENT APPLICATION: US/09/841, 805A TIME: 11:32:06 — - 00 

Input Set : N:\Crf3\RULE60\09841805a.txt 2 FT] 

Output Set: N:\CRF3\11282001\I841805A.raw <§ , ("H 

SEQUENCE LISTING 

3 (1) GENERAL INFORMATION: 

5 (i) APPLICANT: Lai, Preeti 

6 Shah, Purvi 

7 Corley, Neil C. 

9 (ii) TITLE OF INVENTION: HUMAN MEMBRANE RECYCLING 

10 PROTEINS 

12 (iii) NUMBER OF SEQUENCES: 5 

14 (iv) CORRESPONDENCE ADDRESS: 

15 (A) ADDRESSEE: Incyte Pharmaceuticals, Inc. 

16 (B) STREET: 3174 Porter Dr. 

17 (C) CITY: Palo Alto 

18 (D) STATE: CA 

19 (E) COUNTRY: USA 

20 (F) ZIP: 94304 

22 (V) COMPUTER READABLE FORM: 

23 (A) MEDIUM TYPE: Diskette 

24 (B) COMPUTER: IBM Compatible 

25 (C) OPERATING SYSTEM: DOS 

26 (D) SOFTWARE: FastSEQ for Windows Version 2.0 
28 (vi) CURRENT APPLICATION DATA: 

C--> 29 (A) APPLICATION NUMBER: US/09/84 1 f 805A 

C--> 30 (B) FILING DATE: 24-Apr-2001 

31 (C) CLASSIFICATION: 

33 (vii) PRIOR APPLICATION DATA: 

34 (A) APPLICATION NUMBER: 09/360,125 

35 (B) FILING DATE: 1999-07-23 

37 (viii) ATTORNEY/AGENT INFORMATION: 

38 (A) NAME: Billings, Lucy J. 

39 (B) REGISTRATION NUMBER: 36,749 

40 (C) REFERENCE/DOCKET NUMBER: PF-0456 US 

42 (ix) TELECOMMUNICATION INFORMATION: 

43 (A) TELEPHONE: 650-855-0555 

44 (B) TELEFAX: 650-845-4166 
47 (2) INFORMATION FOR SEQ ID NO: 1: 

49 (i) SEQUENCE CHARACTERISTICS: 

50 (A) LENGTH: 347 amino acids 

51 (B) TYPE: amino acid 

52 (C) STRANDEDNESS : single 

53 (D) TOPOLOGY: linear 

55 (vii) IMMEDIATE SOURCE: 

56 (A) LIBRARY: TONGTUT01 

57 (B) CLONE: 980615 

59 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

61 Met Ala Gin Ser Arg Asp Gly Gly Asn Pro Phe Ala Glu Pro Ser Glu 

62 1 5 10 15 

63 Leu Asp Asn Pro Phe Gin Asp Pro Ala Val lie Gin His Arg Pro Ser 
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100 305 310 315 320 

101 Gly Val Phe Ser Asn Pro Ala Val Arg Thr Ala Ala Ala- Asn Ala Ala 

102 325 330 335 

103 Ala Gly Ala Ala Glu Asn Ala Phe Arg Ala Pro 

104 340 345 
106 (2) INFORMATION FOR SEQ ID NO : 2: 

108 (i) SEQUENCE CHARACTERISTICS: 

109 (A) LENGTH: 1521 base pairs 

110 (B) TYPE: nucleic acid 

111 (C) STRANDEDNESS : single 

112 (D) TOPOLOGY: linear 

114 (vii) IMMEDIATE SOURCE: 

115 (A) LIBRARY: TONGTUT01 
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116 (B) CLONE: 980615 

118 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

120 NGACGCAGGC GCAACCCACG GCTGCTGCGG GGATCCTTGT 

121 CCAATCCGTG CACAGAGAAG CGGGGCGAAC TGAGGCGAGT 

122 CCGCTACCGC CACTGCTGCG GCAGGGGCGT GGAGGGCAGA 

123 GCAAACATGG CTCAGAGCAG AGACGGCGGA AACCCGTTCG 

124 AACCCCTTTC AGGACCCAGC TGTGATCCAG CACCGACCCA 

125 GACGTCTACA ACCCTTTTGA GACCCGGGAG CCACCACCAG 

126 GCCCCATTGC CTCCACCCTC AGCTCCCTCC TTGCAGCCCT 

127 GAACCTAAGA ACTATGGCTC ATACAGCACT CAGGCCTCAG 

128 CTGCTGAAGA AACAGGAGGA GCTCAACCGG AAGGCAGAGG 

129 GAGCTGCAGC ATGCTGCCCT GGGGGGCACA GCTACTCGAC 

130 CCTTCTTTTT GTCCAGTTCA GCCCTGCTTT TTCCAGGACA 

131 GAATTTCAGA AGACTGTATC CACCATGTAC TACCTCTGGA 

132 CTCCTGAACT TCCTCGCCTG CCTGGCCAGC TTCTGTGTGG 

133 TTTGGGCTTT CTATCCTCTG GGTCCTCCTT TTCACTCCCT 

134 CGCCCCATGT ATAAGGCTTT CCGGAGTGAC AGTTCATTCA 

135 ATTTTCTTCG TCCAGGATGT GCTCTTTGTC CTCCAGGCCA 

136 TTCAGTGGCT GGATCTCTGC TCTGGTGGTG CCGAAGGGCA 

137 ATGCTGCTGG TCGCCCTGCT CTTCACTGGC ATTGCTGTGC 

138 CGGATCCACT CCTTATACCG CCGCACAGGT GCCAGCTTTC 

139 GCTGCTGGTG TCTTCTCCAA CCCTGCGGTG CGAACCGCAG 

140 GCTGCTGAAA ATGCCTTCCG GGCCCCGTGA CCCCTGACTG 

141 TTGAGGGAGC TGACTTAGCT CCCGTCCCTA AGGTCTCTGG 

142 ACTGATGGCT CCTCCGTAGT GCTCCCAATC CTATGGCCAT 

143 GTGTGGGGAG TTCACTGTGA CCTAGTCCCC CCATCAGGCC 

144 ACGCCCCAAC CCAGCTTCCC TCTGCTGTGC CACGGCTGTT 

145 AAGAAAGTGG AACTGGAACT G 

147 (2) INFORMATION FOR SEQ ID NO : 3: 

149 (i) SEQUENCE CHARACTERISTICS: 

150 (A) LENGTH: 329 amino acids 

151 (B) TYPE: amino acid 

152 (C) STRANDEDNESS: single 

153 (D) TOPOLOGY: linear 

155 (vii) IMMEDIATE SOURCE: 

156 (A) LIBRARY: BRSTNOT01 

157 (B) CLONE: 412453 

159 (xi) SEQUENCE DESCRIPTION :_ _SE.Q ID NO: 3:. 

161 Met Ser Ala Phe Asp Thr Asn Pro Phe Ala Asp 

162 15 10 

163 Pro Phe Gin Asp Pro Ser Val Thr Gin Leu Thr 

164 20 25 

165 Gly Leu Ala Glu Phe Asn Pro Phe Ser Glu Thr 

166 35 40 

167 Val Pro Val Thr Gin Leu Pro Gly Ser Ser Gin 

168 50 55 

169 Pro Ser Val Glu Pro Thr Gin Pro Thr Pro Gin 

170 65 70 75 

171 Ala Gin Ala Gly Leu Leu Arg Gin Gin Glu Glu 
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GCTGGCTCTT 
TGGCGCAGGC 
CTGCTGGTAC 
TTTCTTCTTC 
AGGTTGGGGA 
ATCCGTGCTC 
CATGCTGAAA 
GCAAGAATTT 
AGCCGCTGGG 
GCCCTGCTAC 
GACATCACTA 
CCTGACAGGC 
CACCTCTCAC 
TTTAAATAAA 
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173 Ala Glu Leu Glu Arg Lys Glu Arg Glu Leu Gin Asn Thr Val Ala Asn 

174 100 105 110 

175 Leu His Val Arg Gin Asn Asn Trp Pro Pro Leu Pro Ser Trp Cys Pro 

176 115 120 125 

177 Val Lys Pro Cys Phe Tyr Gin Asp Phe Ser Thr Glu lie Pro Ala Asp 

178 130 135 140 

179 Tyr Gin Arg lie Cys Lys Met Leu Tyr Tyr Leu Trp Met Leu His Ser 

180 145 150 155 160 

181 Val Thr Leu Phe Leu Asn Leu Leu Ala Cys Leu Ala Trp Phe Ser Gly 

182 165 170 175 

183 Asn Ser Ser Lys Gly Val Asp Phe Gly Leu Ser lie Leu Trp Phe Leu 

184 180 185 190 

185 lie Phe Thr Pro Cys Ala Phe Leu Cys Trp Tyr Arg Pro lie Tyr Lys 

186 195 200 205 

187 Ala Phe Arg Ser Asp Asn Ser Phe Ser Phe Phe Val Phe Phe Phe Val 

188 210 215 220 

189 Phe Phe Cys Gin lie Gly lie Tyr lie lie Gin Leu Val Gly lie Pro 

190 225 230 235 240 

191 Gly Leu Gly Asp Ser Gly Trp lie Ala Ala Leu Ser Thr Leu Asp Asn 

192 245 250 255 

193 His Ser Leu Ala He Ser Val He Met Met Val Val Ala Gly Phe Phe 

194 260 265 270 

195 Thr Leu Cys Ala Val Leu Ser Val Phe Leu Leu Gin Arg Val His Ser 

196 275 280 285 

197 Leu Tyr Arg Arg Thr Gly Ala Ser Phe Gin Gin Ala Gin Glu Glu Phe 

198 290 295 300 

199 Ser Gin Gly He Phe Ser Ser Arg Thr Phe His Arg Ala Ala Ser Ser 

200 305 310 315 320 

201 Ala Ala Gin Gly Ala Phe Gin Gly Asn 

202 325 

204 (2) INFORMATION FOR SEQ ID NO : 4: 

206 (i) SEQUENCE CHARACTERISTICS: 

207 (A) LENGTH: 2434 base pairs 

208 (B) TYPE: nucleic acid 

209 (C) STRANDEDNESS : single 

210 (D) TOPOLOGY: linear 

212 (vii) IMMEDIATE SOURCE: 

213 (A) LIBRARY: BRSTNOT01 

214 (B) CLONE: 412453 

216 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

218 NCCGGAAGTG GAGGGTCTAC ACGAAGCGCC GCTGGGTCTG GGTGCCCGGA GGCAGCAGCG 

219 TTCGCGGAGT TCGCCCGCTG GCCCCCGATC ACCATGTCGG CTTTCGACAC CAACCCCTTC 

220 GCGGACCCAG TGGATGTAAA CCCCTTCCAG GATCCCTCTG TGACCCAGCT GACCAACGCC 

221 CCGCAGGGCG GCCTGGCGGA ATTCAACCCC TTCTCAGAGA CAAATGCAGC GACAACAGTT 

222 CCTGTCACCC AACTCCCTGG GTCCTCACAG CCAGCGGTTC TCCAGCCATC AGTGGAACCA 
22 3 ACCCAGCCGA CCCCCCAGGC CGTGGTGTCT GCAGCCCAGG CAGGCCTGCT CCGGCAGCAG 

224 GAAGAACTGG ACAGGAAAGC TGCCGAGCTG GAACGCAAGG AGCGGGAGCT GCAGAACACT 

225 GTAGCCAACT TGCATGTGAG ACAGAACAAC TGGCCCCCTC TGCCCTCGTG GTGCCCTGTG 
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226 AAGCCCTGCT TCTATCAGGA TTTCTCCACA GAGATCCCTG CCGACTACCA GCGGATATGC 

227 AAGATGCTCT ACTATCTGTG GATGTTGCAT TCAGTGACTC TGTTTCTGAA CCTGCTTGCC 

228 TGCCTGGCCT GGTTCTCGGG CAACAGCTCC AAGGGAGTGG ACTTTGGCCT CTCCATCCTG 

229 TGGTTTCTGA TCTTCACTCC CTGTGCCTTC CTTTGTTGGT ACCGACCCAT CTATAAGGCC 

230 TTTAGGTCCG ACAACTCTTT CAGCTTCTTT GTGTTCTTCT TTGTATTTTT TTGTCAAATA 

231 GGGATCTACA TCATCCAGTT GGTTGGCATC CCTGGCCTGG GGGACAGCGG TTGGATTGCA 

232 GCCCTGTCTA CACTGGATAA TCATTCCCTG GCCATATCAG TCATCATGAT GGTGGTGGCT 

233 GGCTTCTTCA CCCTCTGTGC CGTGCTCTCA GTCTTCCTCC TGCAGCGGGT GCACTCCCTC 

234 TACCGACGGA CAGGGGCCAG CTTCCAGCAG GCCCAGGAGG AGTTTTCCCA GGGCATCTTC 

235 AGCAGCAGAA CCTTCCACAG AGCTGCTTCA TCTGCTGCCC AAGGAGCCTT CCAGGGGAAT 

236 TAGTCCTCCT CTCTTCTCTC CCCCTCAGCC TTTCTCTCGC CTGCCTTCTG AGCTGCACTT 

237 TCCGTGGGTG CCTTATGTGG TGGTGGTTGT GCCCAGCACA GACCTGGCAG GGTTCTTGCC 

238 GTGGCTCTTC CTCCTCCCTC AGCGACCAGC TCTCCCTGGA ACGGGAGGGA CAGGGAATTT 

239 TTTCCCCCTC TATGTACAAA AAAAAACAAA GCTCTCTTTC CTTCTCTGGT GATGGTTTGG 

240 TAGGATTCTT TTGTCTCTGG AAGCAGTGGG ACTGAAGTTC TCTTCGTCCT GTGCACACAC 

241 AGACACCCCC ACACAGTTGG GATCACAGGC TGACCTGGGC CCATCCCAGC TGGAGCTTTC 

242 TGCCAGGGTC CTGGGCCTTG ACTCCCCCAC CCTGCAGGCC TGGCCTGAAT CTGGCTTCTT 

243 AGACACAGCC CAGTCCTTCC TGCCTGGGCT GGGAATAAGC CTCTCACAGG TTCTGGTGGA 

244 CAGATCTGTT CCCCAGGTCA CTCCAGTGGT CTCCAGGCTT CCAGAGAAGG CTGGTTGCCT 

245 CAAGCTCTTC TCTGCCTCAT AAACGGATCC AGAGAAGGCT GGTTGCCTTA AGCTCTTCCC 

246 TGCCTCGTGT TCCTGAGAAA CGGATTAATA GCCCTTTATC CCCCTGCACC CTCCTGCAGG 

247 GGATGGCACT TTGAGCCCTC TGGAGCCCTC CCCTTGCTGA GCCTTACTCT CTTCAGACTT 

248 TCTGAATGTA CAGTGCCGTT GGTTGGGATT TGGGGACTGG AAGGGACCAA GGACACTGAC 

249 CCCAAGCTGT CCTGCCTAGC GTCCAGCGTC TTCTAGGAGG GTGGGGTCTG CCTGTCCTGG 

250 TGTGGTTGGT TTGGCCCTGT TTGCTGTGAC TACCCCCCCC CCTCCCCGAA CCGAGGGACG 

251 GCTGCCTTTG TCTCTGCCTC AGATGCCACC TGCCCCGCCC ATGCTCCCCA TCAGCAGCAT 

252 CCAGACTTTC AGGAAGGGCA GGACCAGCCA GTCCAGAACC GCATCCCTCA GCAGGGACTG 

253 ATAAGCCATC TCTCGGAGGG CCCCCTAATA CCCAGTGGAG TCTGGTTCAC ACCCTGGGGG 

254 GTGTGTCACT GTGATGGGAC ACGTAGGAGT CCACCCTTAA AACCAGCACC CTGTCCCTCG 

255 AGGCTGCCGA GTGGGTGTGT GGACTGGGGT GCCTTCCCAC AAAACTAGCC TCCGGCTCTG 

256 GGCCCGAGAC AGCCGCAGGC CCCAGCCACT GAATGATACT GGCAGCGGCT GGGGTTTTAT 

257 GAACTCCTTT CTGGTATTTT TTCCCCTCTA TGTACAAATG TATATGTTAC GTCTCAATTT 

258 TTGTGCTTAA GTAAAAATAA AAACATTTTC AGAC 
260 (2) INFORMATION FOR SEQ ID NO: 5: 

262 (i) SEQUENCE CHARACTERISTICS: 

263 (A) LENGTH: 338 amino acids 

264 (B) TYPE: amino acid 

265 (C) STRANDEDNESS: single . .. . 

266 (D) TOPOLOGY: linear 

268 (vii) IMMEDIATE SOURCE: 

269 (A) LIBRARY: GenBank 

270 (B) CLONE: 487057 

272 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

274 Met Ser Asp Phe Asp Ser Asn Pro Phe Ala Asp Pro Asp Leu Asn Asn 

275 15 10 15 

276 Pro Phe Lys Asp Pro Ser Val Thr Gin Val Thr Arg Asn Val Pro Pro 

277 20 25 30 

278 Gly Leu Asp Glu Tyr Asn Pro Phe Ser Asp Ser Arg Thr Pro Pro Pro 

279 35 40 45 
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